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Abstract 

Background: Retrotransposons have been extensively studied in plants and animals and have been shown to 
have an impact on human genome dynamics and evolution. Their ability to move within genomes gives 
retrotransposons to affect genome instability. 

Methods: we examined the polymorphic inserted /\/i/Ya5, evolutionary young Alu, in the progesterone receptor 
gene to determine the effects of Alu insertion on molecular environment. We used mono-allelic inserted cell lines 
which carry both Alu-present and Alu-absent alleles. To determine the epigenetic change and gene expression, we 
performed restriction enzyme digestion, Pyrosequencing, and Chromatin Immunoprecipitation. 

Results: We observed that the polymorphic insertion of evolutionally young Alu causes increasing levels of DNA 
methylation in the surrounding genomic area and generates inactive histone tail modifications. Consequently the 
Alu insertion deleteriously inactivates the neighboring gene expression. 

Conclusion: The mono-allelic Alu insertion cell line clearly showed that polymorphic inserted repetitive elements 
cause the inactivation of neighboring gene expression, bringing aberrant epigenetic changes. 
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Background 

Retrotransposons have been extensively studied in plants 
and animals and have been shown to have an impact on 
human genome dynamics and evolution. About 42% of 
the human genome contains retrotransposons while DNA 
transposons account for around 2-3% [1-3]. According to 
the 2001 analysis, which has been confirmed overall by the 
2004 update (International Human Genome Sequencing 
Consortium 2004), short interspersed elements (SINEs), 
such as Alu or SINE-R/VNTR/A/w (SVA), account for 
13%, Long interspersed elements [LINE-l(Ll)] for 20%, 
and long-terminal repeat (LTR) retrotransposons, such as 
endogenous retrovirus (ERV), for 8%, respectively, of the 
sequenced human genome. The retrotransposons increase 
their copy number by retrotransposition via RNA. 
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Attempted or successful retrotranspositions carry a high 
risk of eliciting chromosome breaks, deletions, transloca- 
tions, and recombinations [4]. It is estimated that there is 
one Alu retrotransposon insertion every 21 births [5] dur- 
ing gametogenesis, transferring the retrotransposon's 
genetic information to the next generation [6,7]. These 
retrotransposition events are likely to change the activity 
of genes at the insertion site, including increased or 
decreased transcriptional activity. In some cases, this 
alteration of gene expression causes the development of 
several diseases or cancers [8]. DNA methylation on the 
retrotransposon is thought to be the mechanism that con- 
trols the retrotransposition rate. Recent vast numbers of 
publications uniformly address that complex disease, can- 
cer, aging, and environmental challenges are associated 
with aberrant retrotransposon DNA methylation. 

In fact, not all retrotransposons have the capability to 
retrotranspose to other genomic locations. Currently, 
most Lis are inactive and cannot retrotranspose to new 
genomic locations [9], while a small number of human 
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specific LI (L1HS) elements remain retrotransposition 
competent. Retrotransposons seeded in the human gen- 
ome several million years ago and have many subfami- 
lies defined by distinct patterns of diagnostic base 
substitutions. Subfamilies may be classified as young, 
intermediate or old, reflecting the time since the start of 
retroposition by their members. The expansion of Alu 
subfamilies (Ycl, Ya5, Ya2, Yb9, Yb8, Y, Sgl, Sx, and J; 
young to old, respectively) is superimposed on primate 
evolution. The evolutionary young LI, Alu, and SVA 
are currently able to transpose in the human genome, 
hence the ongoing retrotranspositional insertions of the 
youngest subfamilies are not yet fixed in the human 
genome and represent polymorphic loci [10]. Some 
polymorphic insertions are known to be responsible for 
more than 30 human genetic diseases [11-13]. A genetic 
polymorphism names as PROGINS has been identified in 
the progesterone receptor (PGR) gene with insertion of 
Alu subfamily [14]. The correlations of Alu insertion 
polymorphism on PGR gene are associated with endome- 
triosis [15,16], ovarian cancer with diethylstilbestrol 
exposure [17], breast cancer [18], and obesity [19]. Inser- 
tional polymorphic retrotransposons are often observed 
in a mono-allelic fashion, meaning retrotransposons are 
inserted into only one of the alleles in individuals. For 
instance, in chromosome 11, the PGR gene has a newly 
inserted A/wYa5 subfamily between exon 5 and 6. In this 
study, we examine DNA methylation and histone modifi- 
cation of the locus which occurred mono-allelic young 
Alu, AluY&5 insertions and address the direct effect of 
retrotransposon in controlling gene expression. 

Methods 

Nucleic acid isolation and bisulfite treatment 

Genomic DNA was isolated by standard proteinase K 
digestion and phenol-chloroform extraction [20]. Total 
RNA was collected and extracted from cultured cells 
with the RNeasy Protect minikit (QIAGEN Inc., Valencia, 
CA) according to the manufacturer's recommended pro- 
tocol. Reverse transcription was performed by using the 
first strand cDNA synthesis kit (NEB, Beverly, MA, 
USA). Bisulfite modification of genomic DNA has been 
described previously [21]. PCR primer sequences for Alu 
polymorphism with genomic DNA were forward: 
TTGAGTAAAGCCTCTAAAAT and reverse: TTCTTG 
CTAAATGTCTGTT, and with bisulfite DNA were for- 
ward: GAAATTTGAAGGAAATAAATATTAGTGT 
and reverse: CATTTAATTATCCAAAAATATTTTCT- 
TAC TAA. 

Quantitation of allele-specific gene expression by 
Pyrosequencing 

PCR products from genomic DNA or cDNA were used for 
Pyrosequencing analysis as previously described [21]. 



Briefly, the PCR product of each gene was used for indivi- 
dual sequencing reactions. Streptavidin-Sepharose beads 
(Amersham Biosciences) and a Vacuum Prep Tool (Bio- 
tage AB) were used to purify the single-stranded biotiny- 
lated PCR products according to the manufacturer's 
recommendation. The appropriate sequencing primer was 
annealed to the purified PCR product and used for a Pyro- 
sequencing reaction using the PSQ 96HS system (Biotage). 
Raw data were analyzed with the allele quantitation algo- 
rithm using the PSQ 96 HS software. PCR primer 
sequences for Alu polymorphism by Pyrosequencing were 
forward: TTTTCGAAACTTACATATTGA, reverse biotin 
labeled: TTTAGTATTAGATCAGGTGC, and sequencing 
primer: GATCCTACAAACA. For allele-specific expres- 
sion, forward primers: TAGTCAAGTGGTCTAAAT- 
CATTGC, reverse biotin labeled: TTTAGTATTAGA 
TCAGGTGC, and sequencing primer: GATCCTA- 
CAAACA. To validate DNA methylation detection by 
Pyrosequencing, we designed control oligo for 100% DNA 
methylation (PSQ-C oligo: 5'- TATTAGATCGACGG- 
GAACAAACGTTGAATTC -3') and 0% DNA methyla- 
tion (PSQ-T oligo: 5'- TATTAGATCAACGGGAACA 
AACGTTGAATTC -3'). The sequencing primer for con- 
trol oligo is 5'- CAACGTTTGTTCCCGT -3\ We mixed 
PSQ-C oligo (or PSQ-T oligo) with sequencing oligo in 
PyroMark Annealing Buffer (QIAGEN Inc., Valencia, CA) 
and performed Pyrosequencing with sequencing entry 
C/TGATC. 

Chromatin Immunoprecipitation 

ChIP assays were performed as described previously [22]. 
Briefly, 25 \ig crosslinked protein-DNA complexes were 
immunoprecipitated using two different histone modifica- 
tion antibodies (H3K9ac, H3K9me3: Millipore) and eluted. 
Eluted DNA fragments were amplified by PCR. PCR pri- 
mer sequences for the multiplex PCR reaction were LI 
forward: GCCTTGCAGTTTGATCTCAG and reverse: 
GACGGGTGATTTCTGCATTT, AluYS forward: GTGG 
CTCACGCCTGTAATCCCAGC and reverse: GTCGCC 
CAGGCCGGACTGCG, and Alu] forward: TGGCTC 
ACGCCTGTAATCCCAG and reverse: GCCTCGA 
CCTCCCGGGCTCAAGCG. Analyzing density of gel 
bands was performed using ImageJ which is a public 
domain Java image processing program (http://rsb.info. 
nih.gov/ij/). 

Results 

Screening of AluYa5 insertional polymorphisms in cell 
lines 

To find insertional polymorphic retrotransposons, we 
screened Raji, Jurkat, HT15, H1299, MCF, and K562 cell 
lines using the primer sets listed in the Methods section. 
The primers flanked the newly inserted retrotransposon 
A/wYa5 in chrll:100,911,358-100,912,065 locus (Assembly: 
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hgl9), thus presence of Alu insertion could be distin- 
guished by length of PCR amplicon. The PCR amplicon 
with fully inserted Alu generates a 476 bp product, while 
the amplicon without Alu insertion produces a 150 bp pro- 
duct. Among the cell lines we tested, HT15 and H1299 
showed two different sizes of bands after PCR amplifica- 
tion, indicating Alu has inserted in only one allele of the 
genome locus (Figure 1). MCF and K562 showed insertion 
of Alu into both alleles (476 bp products). Raji and Jurkat 
cell lines, however, did not carry an Alu insertion in either 
allele (150 bp products). 

Alu insertion dependent DNA cytosine methylation 

In order to examine the retrotransposon-derived DNA 
methylation spreading theory [23], we determined DNA 
methylation status on the A/w-present and A/w-absent 
alleles, using the mono-allelic inserted cell lines HT15 and 
H1299. The PCR amplicon with bisulfite treated DNA was 
digested with the restriction enzyme HpyCH4III, which 



cut the 5 , ..ACNGT..3' region located on the PCR amplicon 
sequence in only the methylated allele (Figure 2A). Both 
mono-allelic Alu inserted cell lines, HT15 and H1299, 
showed partial digestion of only the A/w-present allele, 
indicating DNA methylation exists in only the Alu inserted 
allele (Figure 2B). The Alu inserted allele in the H1299 cell 
line showed slightly more methylation than the Alu 
inserted allele in the HT15 cell line (Figure 2C). We did 
not observe digestion of the A/w-absent allele. 

Alu insertion derived inactive histone modification 

To determine whether Alu insertion causes histone tail 
modifications, we performed ChlP-PCR with two histone 
modification antibodies against H3K9ac or H3K9me3. 
Acetylation at Lys-9 on histone H3 (H3K9ac) is an active 
chromatin marker and often associated with positive 
gene expression; conversely, methylation at Lys-9 on his- 
tone H3 (H3K9me3) is an inactive chromatin marker and 
correlated with repressed gene expression [24]. After 
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Figure 1 Schematic diagram and genotyping of AluYa5 insertion. A) Schematic of mono-allelic AluYa5 insertion in Chromosome 11. The 
size of /\/u-present allele is 476 bp and /A/u-absent allele is 150 bp. The red line is AluYa5 inserted in human genome. Black circles represent CpG 
sites. The location of PCR primers are represented in blue arrows. B) The genotyping of Alu insertion. The genomic DNA from six cell lines, Raji, 
Jurkat, HT15, H1299, MCF, and K562, are amplified with PCR primer marked above. Single 150 bp bands are generated from bi-allelic /A/u-absent 
cell lines, 150 bp and 476 bp are from mono-allelic inserted cell lines, and single 476 bp are produced from bi-allelic inserted cell line. 
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Figure 2 Alu insertion dependent restriction enzyme digestion. A) Predicted size and cutting site after restriction enzyme digestion. B) 

Agarose gel electrophoresis and C) density of gel bands. Bisulfite-PCR products from mono-allelic Alu inserted cell lines, HT15 and HI 299, are 

digested with HpyCH4lll restriction enzyme. 'Uncut' represent a pre-digestion of PCR products which show the intact /\/u-present (orange line 

and arrow heads) and /\/u-absent (dark blue line and arrow heads). 'HpyCH4W' represent a post-digestion of PCR products which generate 72 bp 

bands (light blue lines and arrow heads) and either 404 bp (green lines and arrow heads) or 78 bp (red lines and arrow heads), depends on the 

DNA methylation status in the CpG site. The density of the gel band from /\/u-present allele was measured. 
^ J 



chromatin immunoprecipitation with the two antibodies 
for active or repressive histone markers, followed by PCR 
amplification, we observed differential histone modifica- 
tion between A/w-present and A/w-absent alleles. The 
active marker H3K9ac is present in only the Alu-absent 
allele; however, the inactive histone marker H3K9me3 
exists in both allele of the genome locus (Figure 3). This 
difference in histone modification has only happened in 
young Alu subfamilies, not all Alu subfamilies. ChIP 
coupled with PCR amplification of Alu], AluYbS, and 
L1HS showed different distributions of histone modifica- 
tions. Alu], the oldest Alu subfamily, co-located with 
both the active marker H3K9ac and the inactive marker 
H3K9me3. However, the young Alu subfamily A/wYb8 
had at least eight times more inactive histone marker 
H3K9me3. In addition, human-specific L1HS did not 
show a different distribution of active or inactive histone 
markers (Figure 4). 

Gene expression repressed by Alu insertion in the 
genome 

To examine differential gene expression in A/w-present 
and A/w-absent allele, we developed an allele-specific 
gene expression detection method using Pyrosequen- 
cing. To distinguish between the two alleles, we geno- 
typed the single nucleotide polymorphism (SNP) at 
chrll:100921952-100922452 (2009 (GRCh37/hgl9) 
assembly), reference SNP ID number is rsl042839, since 



this SNP is correlated with occurrence of Alu insertion 
[25,26]. To confirm this co-existence, we genotyped this 
SNP in the six cell lines we worked with and compared 
with their Alu insertion statuses (Table 1). Hetero Alu 
inserted cell lines HT15 and H1299 showed heterozy- 
gote C/T, A/w-absent cell lines Hep3B2 and HL-60 had 
a C/C genotype, and A/w-present cell lines MCF and 
K562 had a T/T genotype. We confirmed that the 
T allele co-exists with Alu insertion, while the C allele 
co-exists with the absence of Alu insertion in hetero 
A/w-inserted cell lines. Next, we used this SNP to iden- 
tify A/w-inserted alleles for allele-specific gene expres- 
sion detection in a Pyrosequencing reaction. After 
reverse transcription-PCR with mRNA from the hetero 
A/w-inserted cell line H1299, we amplified the locus 
flanking the SNP to detect each allelic gene expression 
level (Figure 5). Surprisingly, we observed unequal gene 
expression levels between A/w-present and A/w-absent 
alleles, 10.5% and 89.5% respectively, having an equal 
distribution of both alleles in the genome (46.7% of Alu- 
present allele vs 53.3% of A/w-absent alleles with geno- 
typing data). Thus the presence of Alu in the gene body 
repressed gene expression at the allele containing the 
Alu element. 

Discussion 

We examined the polymorphic inserted young Alu, 
AluY&5, in the PGR gene to determine the effects of Alu 
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Figure 3 Chromatin Immunoprecipitation assay to assess differential histone modifications on >A/u-absent or >A/u-present allele 

Representative gel showing chromatin preparations from immunoprecipitated with anti-IgG (lanes 4), anti-H3K9ac (lane 5), anti-H3K9me3 (lane 
6), or the negative control non-immune serum (lanes 3). Lane 2 is non-immunoprecipitated DNA. Lane 1 is DNA size marker (A). The density of 
the gel band from Alu-present and /A/u-absent allele was measured (B). '+ Alu' is the /\/u-present allele and '- Alu' represent the /A/u-absent allele. 



insertion on the near gene environment. We used 
mono-allelic inserted cell lines which carry both Alu- 
present and A/w-absent alleles. We observed that the 
polymorphic insertion of evolutionally young Alu causes 
increasing levels of DNA methylation in the surrounding 
genomic area and generates inactive histone tail modifi- 
cations. Consequently, the Alu insertion deleteriously 
inactivates the neighboring gene expression (Figure 6). 

It is a novel approach to address the cis-effects of retro- 
transposons or retrotransposition in neighboring geno- 
mic structures using a mono-allelic inserted young Alu 
subfamily. These effects were observed in a single cell 
line system, and virtually all conditions at the particular 
locus are the same; the only difference being the presence 
or absence of a retrotransposon insertion. Thus this 



system bypasses many concerns about experimental arti- 
facts being solely responsible for deducing the function 
of retrotransposons in the genome. 

Generally, our results agree with previous reports that 
retrotransposons may repress gene expression through 
an epigenetic mechanism. Our study strongly supports 
the observations that young active retrotransposons 
insert in areas that lack cytosine methylation. Retrotran- 
sposons spread DNA methylation into neighboring 
regions, generating repressive histone modifications. It 
causes a significant inactivation of gene expression. Holl- 
ister et al. reported the correlation of transposable ele- 
ments and gene silencing; however the caveat was that 
the data do not show whether repetitive elements tend to 
preferentially insert near lowly expressed genes or 
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Figure 4 Chromatin Immunoprecipitation assay to assess differential histone modifications of global LI HS, young Alu; AluYb8 and old 
Alu; AluJ. Representative gel showing chromatin preparations from immunoprecipitated with anti-IgG (lanes 4), anti-H3K9ac (lane 5), anti- 
H3K9me3 (lane 6), or the negative control non-immune serum (lanes 3). Lane 2 is non-immunoprecipitated DNA. Lane 1 is DNA size marker (A). 
The density of the gel band from L1 HS, AluYbS, and AluJ was measured (B). 



whether the insertion of repetitive elements causes the 
low gene expression [27]. However, our mono-allelic 
inserted cell line system clearly showed that repetitive 

Table 1 Alu insertion and genotyping in cell lines 
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SNP 
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elements cause the inactivation of neighboring gene 
expression. 

It has been estimated that approximately one out of 
every 21 births, 212 births, and 916 births has a new inser- 
tion of Alu, LI, and SVA retrotransposition, respectively 
[10]. Thus there is a great deal of retrotransposition in the 
current human genome. It has been know that evolution- 
ally young repetitive elements have the capability to retro- 
transpose to other genomic locations. In our study, the 
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Figure 5 Differential level of PGR gene expression on >4/u-absent or -present allele. A) PCR combined with Pyrosequencing shows similar 
proportion of C (/4/u-absent; 53.3%) and T allele (/A/u-present; 46.7%) in genomic DNA. Reverse transcription-PCR coupled with Pyrosequencing 
shows different proportion of C (/A/u-absent; 89.5%) and T allele (/A/u-present; 10.5%). B) Representative program of genotyping and gene 
expression. 




Figure 6 Summary of molecular signature on mono-allelic Alu 
insertion. /\/u-present allele has cytosine methylation in neighbor 
CpG and repressive histone modification. /\/i/-absent allele shows no 
CpG methylation in near CpG site and has additionally active 
histone modification, hence show active gene expression. 



inactive histone modification solely existed in the young 
Alu subfamily and disappeared in the old Alu J subfamily. 
Coincidently, global Alu J has less DNA methylation than 
the young and active Alu Y family (data not shown). 
Hence this epigenetic difference may promote the 
mechanism that facilitates transposon mobility. However, 
what triggers this phenomenon is still not clear, though 
environmental cues are believed to be responsible for pro- 
moting movement of DNA transposons and retrotranspo- 
sons. DNA methylation on retrotransposons is thought to 
be an intermediate of the retrotransposition mechanism. 
We have observed aberrant cytosine methylation patterns 
on retrotransposons with environmental challenges, but 
do not know what causes these events or what conse- 
quences follow them. Based on the observation of our 
data, aberrant cytosine methylation on retrotransposons 
caused by environmental challenges may trigger retrotran- 
sposon mobility, slowly reshaping human genome. In the 
future, it will be necessary to understand the function of 
other types of retrotransposons of different ages in order 
to finally resolve the meaning of this aberrant epigenetic 
phenomenon driven by environmental challenge. 
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Conclusions 

The mono-allelic Alu insertion cell line clearly showed 
that polymorphic inserted repetitive elements cause the 
inactivation of neighboring gene expression, bringing 
aberrant epigenetic changes. 
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